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Abstract: Underwater environments are quite different from terrestrial environments in 
terms of the communication media and operating conditions associated with those 
environments. In underwater sensor networks, the probability of node failure is high 
because sensor nodes are deployed in harsher environments than ground-based networks. 
The sensor nodes are surrounded by salt water and moved around by waves and currents. 
Many studies have focused on underwater communication environments in an effort to 
improve the data transmission throughput. In this paper, we present a checkpointing scheme 
for the head nodes to quickly recover from a head node failure. Experimental results show 
that the proposed scheme enhances the reliability of the networks and makes them more 
efficient in terms of energy consumption and the recovery latency compared to the 
previous scheme without checkpointing. 
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1. Introduction 

As the amount of underground resources such as known stores of crude petroleum increases, the 
exhaustion of these resources is also being accelerated. Moreover, some countries that are rich in 
natural resources put pressure on other countries by limiting the export volume of their resources. 
Developing alternative types of resources is one solution, but the results so far have not been practical 
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or cost-effective. The ocean, which occupies 70% of the surface of the Earth, has received attention 
related to resource development, not only for its size but also for its potential. 

Underwater Wireless Sensor Networks (UWSNs) are significantly different from Terrestrial 
Wireless Sensor Networks (TWSNs) in many aspects, such as their high latency, high error probability 
of each node, and their high communication cost [1]. UWSNs also have some disadvantages including 
the unavailability of real-time monitoring, limited interaction between an onshore control center and 
the monitoring instruments, and lack of a mechanism to detect failures quickly [2]. These 
characteristics make existing TWSNs unsuitable as UWSNs. Specifically, the major concerns in 
UWSNs are reduced energy use and reliable communication, since the sensor nodes are powered by 
batteries, which are difficult to replace or recharge in underwater environments and have a high error 
rate. Therefore, node clustering, which has been widely studied in UWSNs, is an effective technique to 
improve energy efficiency and to simplify the network architecture [3]. In addition, numerous studies 
have attempted to improve the reliability of the communication based on clustering protocols. 

A cluster-based UWSN is similar to a TWSN. The clustering protocols elect a head node, and other 
nodes transmit sensing data to their head nodes. The head node transmits the collected data to an 
underwater sink that is specially designed to communicate with the surface sink. In its role, the head 
node is more important in UWSNs than other nodes. If a cluster head failure is not accurately detected, 
it will unnecessarily execute a fault recovery process (head node reelection or network re-clustering) 
and thus, waste energy in the sensor network. To avoid this, it is important to be able to accurately 
detect head node failures [4]. 

To reduce latency and improve reliability during the recovery process, we propose a checkpointing 
scheme, which stores the state of the head node and repairs the head node when it fails. The head node 
sends routing information and collected data to the backup nodes that periodically save this 
information. If a head node experiences a transient fault, one of the backup nodes detects the head 
node failure and becomes the new head node. The head node can quickly recover from a transient fault 
by omitting the reelection of a head node and by preventing the loss of the collected information. Even 
if a head node experiences a permanent fault, a backup node becomes the new head node immediately. 

The balance of this paper is organized as follows. Section 2 describes the architecture and 
background knowledge pertaining to underwater sensor networks. In Section 3, we describe some 
related works to improve the reliability of UWSNs. In Section 4, we propose the recovery scheme for a 
head node failure. In Section 5, we evaluate the proposed scheme. Finally, we provide some 
concluding remarks in Section 6. 

2. Underwater Sensor Networks 

2.1. Underwater Channels 

TWSNs use radio frequency waves, but UWSNs rely on physical means like acoustic waves to 
transmit signals. In terms of protocols, the networks stacks of TWSNs are not suitable for UWSNs 
since low bandwidths and large latency result in long end-to-end delays [5]. Acoustic signals have 
unique characteristics. First, acoustic signals tend to have high transmission losses due to attenuation 
and geometric spreading. The attenuation is induced by absorption due to the conversion of the 
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acoustic energy into heat. Geometric spreading means the propagation of sound energy because of the 
expansion of the wave front. Second, it is easy for acoustic signals to be affected by noise. Noise 
includes man-made noise caused by shipping activities and ambient noise caused by natural phenomena 
such as tides, currents, and storms. 

The speed of sound in sea-water is a function of the temperature, salinity and depth, as expressed by 
Equation (1): 

C(D,S,T)= 1448.96 + 4.59T - 5.3 x 1(T 2 T 2 + 2.374 x 10" 4 T 3 + 1.34(S - 35) 

+ 1.63 x 1(T 2 D + 1.675 x 1(T 7 D 2 - 1.025 x 10" 2 T(S - 35) - 7.139 (1) 
x 10 _13 TD 3 

Here, T, S, and D refer to the temperature in degrees Celsius, the salinity in parts per thousand, and 
the depth in meters respectively. This equation is valid within the temperature range from 2 to 30 °C, 
25 to 40 parts per thousand (%o) salinity and depths from 0 to 8,000 m. Figure 1 shows the change in 
the speed of the acoustic signal with the temperature and the salinity if the depth is fixed at 50 m. 
When the temperature of seawater increases, the speed of the acoustic signal also increases steadily. 
The speed shows a dramatic increase if the salinity increases. The effect of the salinity is higher than 
that of the temperature. 

Figure 1. The speed of the acoustic signal. 
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Equation (2) determines the attenuation of the acoustic signal [A(^)] according to the distance (d) 
and the communication frequency (/) in the ocean [6]: Here, d is the distance (m),/is the frequency 
(kHz) and a(f) is the absorption coefficient (dB/km) 

A(d) = da d ,a= W^n/io 

f 2 f 2 (2) 

<f) = 0.11 jL— + 44-J-—+ 2.75 . 10-4/2 + 0 .003 1 ; 

Here, d is the distance (m),/is the frequency (kHz) and a(/) is the absorption coefficient (dB/km) 
Figure 2 shows a graph of this attenuation with the distance and the communication frequency. In 
UWSNs, the available bandwidth (under 50 kHz) is severely limited due to the extreme characteristics 
of the underwater channel. As the distance increases, the attenuation also increases rapidly. 
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Figure 2. The attenuation of the acoustic signal. 




To improve the efficiency of data transmission using the acoustic signal, the design of the network 
protocol stack is important. A protocol stack for UWSNs should manage power and promote cooperation 
among the underwater sensor nodes. The protocol stack consists of physical layer, data link layer, 
network layer, transport layer, and application layer functionalities, as shown in Figure 3. 

Figure 3. Cross layer protocol stack. 
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The security service protects the underwater sensor nodes and their communications. The power 
management plane is responsible for the network functionalities aimed at minimizing the energy 
consumption. The time synchronization service adjusts the local time of each sensor nodes to the 
global time. The localization service is responsible for providing absolute or relative localization 
information to the underwater sensor node, when needed by the protocol stack or by the application. 

2.2. Underwater Platforms 

The hardware includes underwater sensor nodes, surface sinks and mobile Autonomous Underwater 
Vehicles (AUVs), as shown in Figure 4 [7]. Each sensor node is equipped with a processor, memory, 
storage and an acoustic modem. Nodes also include temperature sensor, pressure sensor, a gyroscope 
and a camera to monitor their environments [8]. The sensor nodes communicate with each other using 
a Time Division Multiple Access (TDMA) protocol to avoid signal collisions. The signal propagation 
speed of an underwater acoustic channel is about 1.5 x 10 3 m/s, which is much lower than the Radio 
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Frequency (RF) propagation speed (3.0 x 10 m/s) [9]. The data rate is less than 8,000 bits/s because 
the available bandwidth of underwater acoustic channels is limited and depends on both the 
transmission range and the frequency [10]. These limitations are caused by the absorption of acoustic 
signals that operate below 30 kHz. The features of underwater acoustic channels are a long propagation 
delay, limited available bandwidth and high error probability [11]. 



An autonomous underwater vehicle (AUV) is a vehicle which travels underwater without requiring 
commands from an operator. AUVs include sensors to prevent collisions, propulsion devices to move 
them forward, and batteries to supply power. They collect data from head nodes as they travel among 
head nodes. A surface sink node is attached onto a floating buoy with satellite or RF capabilities to 
transmit data to an onshore control center. It aggregates data sent from head nodes by using the 
acoustic signal. To reduce propagation delay, the surface sink connects to the acoustic modem by wire 
near the bottom of the ocean. The onshore control center manages and monitors the entire network. 

2.2. Architecture 

UWSNs have a number of limitations, such as their harsh environment, high mobility, long 
propagation delays and limited communication bandwidth. Under such conditions, communication 
failures arise more frequently in UWSNs than they do in TWSNs. Therefore, a technique is needed for 
alternating the direct communication between the sensor nodes and surface station. A clustering or 
hierarchical protocol can improve scalability, energy efficiency and the lifetime of the network 



The architecture of underwater wireless sensor networks is presented in Figure 5 [1]. A group of 
sensor nodes is anchored to the bottom of the ocean. Sensor nodes are interconnected to one or more 
head nodes via wireless acoustic communication. Head nodes are devices in charge of relaying data 
from sensor nodes located at the bottom of the ocean to AUVs indirectly or a surface sink directly. 
AUVs dive between the surface and go to the bottom of the ocean. They communicate and relay the 
collected data sent from head nodes to the surface sink to reduce the energy consumption of the head 
nodes and the data transmission latency. The surface sink is equipped with an acoustic transceiver 



Figure 4. Underwater platforms. 
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capable of handling multiple communication instances with AUVs. The surface sink also supports a 
long-rage RF and satellite transmitter to communicate with the onshore control center. The onshore 
control center collects data from the surface sink and manages the underwater network. 

Figure 5. UWSN architecture. 
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3. Related Works 

3.1. Data Transmission 



UWSNs using acoustic signals have limited bandwidth and long propagation delays. There have 
been many studies to improve the reliability of data transmission under these conditions. The Reliable 
Routing and Application-based Scheduling protocol (RRAS) [12], which is a priority scheduling 
approach for multihop topologies, tries to balance between reliability and efficiency at the Media 
Access Control (MAC) layer. To achieve this goal, it utilizes the NACK-retransmission mechanism 
that delays the request of lost packets for an additional retransmission period. It divides time lots by 
the data transmission period and the retransmission period to reduce control frame handshaking and 
improve the throughput. 

In [13], Multiple-path Forward Error Correction (M-FEC) reinforces the existing Multi-Path 
Communication (MPC) in terms of reliability and energy efficiency. In previous MPC scenarios, 
receivers requested the retransmission of the lost packet whenever the original packet was damaged. 
However, error detection and packet reassemble were only done by the destination node in M-FEC. 
M-FEC is more efficient than MPC because retransmission can be eliminated in the intermediate nodes. 

Segmented Data Reliable Transfer (SDRT) based on a hybrid Automatic Repeat Request (ARQ) 
and FEC approach was proposed in [14]. ARQ requires the transmitter to resend the packet in which 
the error has been detected. FEC uses special codes that allow the receiver to detect and correct a 
limited number of errors. This hybrid mechanism supports quick encoding and decoding algorithm 
called Simple Variant of Tornado (S VT) codes to improve the reliability more than the pure ARQ. 

These approaches focused on the reliability of data transmission in the physical or the data link 
layer. They tried to maintain stable communications which is a base condition to collect sensing data. 
However, it is hard to verify the performance of these MAC-based schemes in practical applications 
and this causes compatibility problems. If sensor platforms use different MAC-based schemes, they 
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cannot communicate with each other. Our approach is easy to implement, because the proposed 
scheme is applied to the application layer. 

3.2. Routing Protocols 

The routing protocols determine the path from a source node to a destination node. In [15], a robust 
location based routing scheme was proposed. To achieve the robustness and energy efficiency, a novel 
routing protocol called Hop by Hop Vector Based Forwarding (HH-VBF) was suggested. HH-VBF 
overcomes the limitations of the existing VBF such as a small data delivery ratio and high throughput 
fluctuations through a self-adaption procedure. That means the decision of forwarding a packet to the 
next node is based on hop-by-hop. 

Energy Aware Routing for Real-Time and Reliable Communication (EARQ) [16] provides 
real-time and reliable delivery of packets by considering the energy awareness of the next path. It is 
one of the proactive routing protocols that maintain a routing table on each node. To achieve real-time 
delivery, only paths that may deliver a packet in time are selected. EARQ may send a redundant packet 
through an alternate path to improve the reliability. 

These routing protocols aimed to improve reliability of the packet transmission at the network layer 
by selecting the next proper node. These studies tried to modify routing algorithms of TWSNs 
considering to features of UWSNs. We designed our checkpointing scheme focused on clustering based 
routing algorithms, because the most typical UWSN architecture is based on a hierarchical structure. 

4. Head Node Checkpointing 

4.1. Requirements 

The head node plays an important role in coordinating its cluster and collecting data from its cluster 
members in clustering protocols [17]. When a failure occurs at the head node, member nodes should 
quickly and correctly detect the failure of the head node. If the member nodes are unaware of the 
failure at the head node, they send meaningless data and therefore waste energy. Some member nodes 
determine that a failure of the head node occurs despite the fact that the head node is operating properly. 
This wrong decision leads to unnecessary energy consumption due to recovery and re-clustering of the 
network. Therefore, it is important to reduce recovery latency and prevent the head node reelection 
process so as to prolong the lifetime of the network. 

4.2. System Design 

We propose a checkpointing scheme for the head node in clustering routing protocols to minimize 
the recovery cost and latency. The head node election phase aims to select a stable node as a cluster 
head and give an equal opportunity to every node to evenly distribute the node's energy consumption. 
This phase involves heavy message exchanges among member nodes such as [18]. Each member node 
broadcasts its weight to others [(N - 1) ] and the head node, which has the highest weight, sends their 
ID to member nodes (N - 1). During the head node election step our scheme elects additional backup 
nodes by using the fact that head nodes notify their IDs in the end of head node election step, to 
checkpoint the information of a head node. The state of backup nodes is similar to the head node in 
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terms of the residual energy or their ability to operation. All collected information sent by normal 
nodes to the head node is also saved in backup nodes. The backup nodes periodically detect the state of 
the head node. If the head node has a transient problem, one of the backup nodes that is the closest 
node to the head node replaces the failed head node and serves as a new head node. 

Figure 6 shows an overview of our scheme as the checkpointing mechanism is applied to the head 
node. When the head node operates properly (clusters A and C), backup nodes save only the checkpoint 
information and monitor the state of the head node. In the case of cluster B, the head node cannot carry 
out its tasks when it encounters either a software or hardware problem. A backup node then operates as 
a head node based on the checkpointing information. Through this checkpointing scheme, we can 
prevent information loss caused by failure in the head node, and we can reduce the recovery latency 
caused by the frequent reelection of a head node. 

Figure 6. Overview of the proposed scheme. 
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4.3. Energy Consumption Model 

We use a Markov model to find the minimum number of backup nodes that meets the expected 
reliability of users and the energy analysis model to determine the optimal checkpointing interval. The 
model that is designed to represent the energy consumed during the transmission of data is divided into 
a shallow ocean (depth of water less than 100 m) and a deep ocean (depth of water greater than 100 m) 
model in UWSNs. In this paper, the energy consumption of the data transmission model is based on 
the shallow ocean model [19]. Table 1 shows the notations and functions used in this paper to model 
our system. 

4.3.1. Assumptions 

To simplify our model, we make the following assumptions: 

The reference network model based on [20] 

All nodes know their residual energy 

There is no communication error between two nodes 

All nodes are fixed (i.e., we do not consider node mobility) 

The failure rate (k) is based on a Poisson distribution 
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4.3.2. The Minimum Number of Backup Nodes 

There is a trade-off between reliability and energy consumption. As the number of backup nodes 
increases, reliability increases, but the energy consumption of checkpointing also increases. As a result, 
the lifetime of network decreases. Therefore, we find the minimum number of backup nodes that 
satisfies the expected level of reliability. We apply the Markov model to determine the minimum 
number of backup nodes when the expected reliability is specified by a user or an application designer. 

In [21], there is a special case of the birth-death process that involves the use of continuous -time 
Markov model. Figure 7 shows the state diagram of our model, where the state refers the number of 
failure nodes. 

Figure 7. The state diagram. 
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Table 1. List of notations used in this paper. 



Notation Description 



N 


The number of sensor nodes 


n 


The number of backup nodes 


X 


Failure rate of each node 


n 


Repair rate of backup nodes 


p 


l/fi 


E tx 


The amount of energy to data transmission between two nodes 


S 


The transmission power (dBm) 


d 


Distance between two nodes 


H 


The depth of water 


I 


The intensity of acoustic signal 


Ttx 


Transmission time 


K 


Packet size 


Pk 


The failure probability of head node at the k t h data transmission 


^pre 


The energy consumption of previous scheme (without checkpointing) 


Eckpt 


Energy consumption of the proposed scheme (with checkpointing) 


Ickpt 


Checkpointing interval 


Dpre 


Recovery latency of previous scheme (without checkpointing) 


Dckpt 


Recovery latency of the proposed scheme (with checkpointing) 


s 


The guard time 


dmax 


Distance from the farthest sensor node to the head node 
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If the failure rate of each node (including the head node) is X and the repair rate is |i, the expressions 
for steady-state probabilities are obtained via Equations (3) and (4): 

k-l 

nA(n - i) 
,0 <k <n (3) 

i=0 ^ 

1 

TT 0 = 



yn n k _J±_ (4) 
^k=oP (n _ k)! 



When each node has its own repair facility, such as a watchdog timer with repair rate \i, the 
availability of an individual component is obtained via Equation (5) and the steady-state availability is 
computed via Equation (6): 

11 

Aindiv " " YT~p (5) 

p n n! 

Asteady - 1 " TT n - 1 - jjj (6) 

L k=0 P (n _ k)! 

When A st eady is equal to the expected reliability, |i is equal to the frequency of the watchdog timer. If 
the failure rate of each node, (X), is given, we can define the minimum number of backup nodes (n) 
through Equation (6). 

4.3.3. The Optimal Checkpointing Interval 

Figure 8 shows the time line of the clustering protocols that use TDMA-based scheduling. Each 
round is composed of a setup stage and a steady stage. All clusters are formed during the setup stage, 
and data transmission occurs during the steady stage. A steady stage comprises several frames, and 
each frame is divided into time slots that are assigned to each member node so as to send data to a head 
node without communication interference. There is some guard time (GT: 5) in order to avoid acoustic 
collisions at the head node when two member nodes using adjacent time slots send their data 
message [22]. In this scheme, the head node can calculate the delay and distance to each member node 
through the time-of-arrival approach that measures the round-trip time of an acoustic signal between 
the head node and a member node [19]. 

If the failure rate of each node is X, we define e~ x T , as there is no failure of each node during the 
total data collection time from all member nodes (T). Under this condition, the probability of failure is 
( \ k_1 

P k = (e~ T J (1 — e~ T ) when the head node gathers data from the k th node. In underwater 

environments the energy consumption of data transmission between two nodes is defined by Equation (7): 

E tx = S xT tx xK,S = 27TXHxI (7) 

To compare the energy consumption of our checkpointing scheme with that of an existing scheme, 
we define E pre and E C k pt as in Equations (8) and (9), respectively: 

Epre = Sk=o{(l " Pk) x E tx + P k x E elec }, E elec = { (N — l) 2 + 2(N — 1)} x E tx (8) 
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N-1 



Eckpt = ^{(1 " Pk) x E tx + P k x (N - 1) x E tx } + n x 

k=0 



ckpt 



x E 



tx 



(9) 



In the previous clustering routing protocol, which does not support the checkpointing mechanism, 
when a head node fails, this causes the election of a new head node which broadcasts the remaining 
energy notification messages, finds the member nodes and constructs a routing table [4]. 

Figure 8. TDMA-based data collection. 
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If a head node fails at the fc t h step, the difference between the energy consumption of the previous 
scheme and that of the proposed scheme is defined by Equation (10), as we assume that a head node 
failure occurs only once per round (7): 

k 



P k x E elec > P k x (N - 1) + n x 



ckpt 



> Ickpt > 0 



(10) 



The optimal checkpointing interval is derived through Equation (11). The minimum value of I C k pt 
that satisfies this condition is optimal. 

nk 

'ckpt > ( N _ 1)2 + ( N _ i) ( n ) 

4.3.4. The Recovery Latency 



As the recovery latency is in direct proportion to the number of required messages, we compare the 
recovery latency of the checkpointing scheme with that of previous schemes, as shown in Equation (12): 

n-1 



»»re = y^7:+ 2 x (^) + (n+l)x 5 
pre Z-.1500 V1500/ 

k=0 ^ 

Dckpt = Ickpt + + 6 



(12) 



The acoustic signal is propagated at 1,500 m/s underwater [23]. Each sensor node in a cluster sends 
collected data to the head node via an assigned time slot based on TDMA. The guard time (S) is used 
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to avoid collisions with the acoustic signal, and it is inserted at two consecutive time slots. The 
recovery latency is related to the number of messages and one of the member nodes. If the number of 
messages that include the failure notification packet and the control packets to reelect a new head node 
increase the recovery latency also increases. Increasing the number of member nodes affects the 
latency in the same way. The data transmission and reelection process are completed when the farthest 
node from the head nodes receives the acknowledgement that is used to confirm the success of data 
transmission between the two nodes. 

5. Performance Evaluation 

5.1. Experimental Environment 

We evaluate our scheme in terms of energy efficiency and recovery latency. Table 2 describes the 
parameters used for the evaluation. The value of the parameters is based on [24], a study that investigated 
the transmission power and the communication latency in UWSNs. 

Table 2. Parameters for the simulation. 



Notation Description 

N 10<Af<50 

N 3 

H 50 m 

I 0.061 dB 

T tx 280 ms 

K 64 bytes 

A 1.35 x 10" 6 (0<A< 1.0) 

M 2.7 x 10" 6 (0<A< 1.0) 

P 0.5 (Mfi) 

Ickpt 25 s 

S 500 ms< S < 1,000 ms 

d max 15 km (the maximum transmission range of acoustic modem = about 20 km) 



5.2. Checkpointing Interval 

Figure 9 shows the steady-state availability of our scheme via the number of backup nodes where 
p (?i/|i) is Equation (6). If the rate of the watchdog timer is higher than the failure rate (p < 1), the 
reliability of our system is greater than 80% when using three backup nodes. If the repair rate is 
identical to the failure rate (p = 1), our system maintains a reasonable level of availability (more than 
73%) using just three backup nodes. Therefore, we determined the minimum number of backup nodes 
as 3. 

We simulated two schemes using MATLAB [25], a well-known simulator for verifying numerical 
models. We implemented our checkpointing scheme based on a clustering-based routing protocol. As 
shown in Figure 10, when the number of member nodes in a cluster increases in a range from 10 to 50, 
the average energy consumption of our checkpointing scheme is less than that of the previous scheme 
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(without checkpointing). Our scheme is also superior to the previous scheme in terms of scalability, as 
the energy consumption of the proposed scheme steadily increases with the number of nodes in a cluster. 

Figure 9. The steady-state availability. 




Figure 10. Comparison of the energy consumption values. 
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The energy consumption of the scheme without checkpointing is higher than that of our scheme, 
and the difference between the two schemes steadily increases as the number of nodes in a cluster 
increases. With this extra energy, our scheme reduces the checkpointing interval and increases the 
reliability of the underwater sensor network. In this case, we derived optimal checkpointing intervals 
of between about 160 s and about 30 s when the number of sensor nodes ranged from 10 to 50 in 
Figure 1 1 . The results show that as the number of sensor nodes increase, the amount of extra energy 
(E pre - E ckpt ) increases, as does the amount of checkpointing messages. In summary, the optimal 
checkpointing interval approaches 20 s as the number of sensor nodes in a cluster increases. 

To verify the proposed energy consumption model, we simulated our scheme under the uniform 
distribution of a node failure rate. When the number of nodes is 10 and 20 respectively, the energy 
consumption of each scheme is shown as Figure 12. In case of previous scheme, without checkpointing, 
as the number of nodes increases, the energy consumption increase exponentially and the variation in 
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each round is also fluctuating, but in the case of the proposed scheme, with checkpointing, the 
affection of the number of nodes is limited and the variation in each round is also small. 

Figure 11. The checkpointing interval. 




25 30 35 
The number of nodes 



Figure 12. Comparison of the energy consumption values (randomized). 
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5.3. Recovery Latency 

To compare the recovery latency between the scheme with checkpointing and that without 
check-pointing, we deployed sensor nodes randomly, meaning that we assigned the distance between a 
head node and each member node (ranging from 1 to 15 km) arbitrarily. The recovery latency is affected 
by the number of messages sent during the recovery process. In the scheme without checkpointing, 0(n 2 ) 
messages are generated during the re-election process as the number of nodes increases in a cluster. 
However, our checkpointing scheme when applied generates only 0(n) messages via the backup node; 
thus, the recovery latency with checkpointing increases linearly, as shown in Figure 13. 

Figure 14 shows the variation of recovery latency as the guard time increases range from 500 ms to 
1,000 ms. The guard time that prevents the collision of acoustic signals affects to the recovery latency 
directly. The recovery latency with checkpointing scheme is more efficient than that of without 
checkpointing scheme even if the guard time is up to 50 s. 



Sensors 2012, 12 



1208 



Figure 13. The recovery latency comparison. 
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Figure 14. The recovery latency as the guard time (<5). 
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6. Conclusions 



In the design of energy efficient underwater sensor networks, it is necessary to consider the resource 
constraints of the sensor nodes and the underwater environment. Underwater sensor networks operate 
in harsher conditions than TWSNs. The applications of UWSNs are also expanding into areas that 
represent harsher and generally more dangerous environments. Therefore, the reliability of the network 
is considered to be of upmost importance. 

In this paper, we proposed a checkpointing scheme for clustering routing protocols. Through our 
simulation results, we showed that the proposed checkpointing scheme results in an UWSN that is 
more energy efficient and has lower recovery latency when the head node fails than the previous 
schemes. 
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